Journal article
Automatically Determining the Number of Clusters in Unlabeled Data Sets
Liang Wang, Christopher Leckie, Kotagiri Ramamohanarao, James Bezdek
IEEE Transactions on Knowledge and Data Engineering | Institute of Electrical and Electronics Engineers | Published : 2009
Abstract
One of the major problems in cluster analysis is the determination of the number of clusters in unlabeled data, which is a basic input for most clustering algorithms. In this paper, we investigate a new method called Dark Block Extraction (DBE) for automatically estimating the number of clusters in unlabeled data sets, which is based on an existing algorithm for Visual Assessment of Cluster Tendency (VAT) of a data set, using several common image and signal processing techniques. Its basic steps include 1) generating a VAT image of an input dissimilarity matrix, 2) performing image segmentation on the VAT image to obtain a binary image, followed by directional morphological filtering, 3) app..
View full abstractGrants
Awarded by Australian Research Council (ARC)
Funding Acknowledgements
The authors would like to express their thanks to Associate Editor Dr. Domenico Talia and the anonymous reviewers for their insightful comments that helped improve this paper. This work is partially supported by the Australian Research Council (ARC) Discovery Project (Grant DP0663196).